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Site-Specif.ir Rscomhination in Eukarvotes 
and Consr.ructs useful Thsrefor 

FIELD OF THE INVENTION 

The present invention relates to methods for 
manipulating chromosomal sequences in cells by site- 
specific recombination promoted by recombinases. In a 
5 particular aspect, the present invention relates to methods 
for producing embryonic stem cells bearing nucleic acid 
sequences that have been rearranged by a site-specific 
recombinase expressed from a construct controlled by a 
tissue-specific promoter (e.g., a germline specific 
10 promoter). In another aspect, the present invention 
•relates to methods for producing embryonic stem cells 
bearing nucleic acid sequences that have been rearranged by 
a site-specific recombinase expressed from a construct 
controlled by a conditional promoter. 

15 BACKGROUND OF THF INVENTION 

The analysis of gene function has increasingly come to 
require the production of subtle, tissue-specific, and 
conditional mutations in animals and plants. Although 
there are a number of methods for engineering subtle 
20 mutations in embryonic stem (ES) cells (Hasty et al . (1991) 
Nature 350:243-246, Askew et al . (1993) Mol Cell Biol 
13:4115-4124), the use of site-specific recombinases to 
remove the selectable marker that permits isolation of 
homologously recombined ES cell clones has become 
25 increasingly prevalent (Kitamoto et al . (1996) Biochem 
Biophys Res Cowwun 222 :7 42-747 , Fiering et al . (1993) Proc 
Natl Acad Sci USA 90:8469-8473, Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081; Gu et al . (1993) Cell 
73:1155-1164; Sailer et al . (1996) Taniguchi Symposia on 
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Brain Science, eds . Nakanishi et al. (Japan scientific 
press) . PP- 89-98) . 

Site-specific recombinases represent the best method 
for creating' tissue-specific and conditional stations in 
5 animals and plants, being employed first to remove the 
selectable marker to create a functionally wHd-type 
allele, and then to inactivate the allele mosaxcally in 
animals and plants by removing some essential -mponent ,n 
a tissue-specific or conditional manner (Gu et al. (1994, 
10 science 265:103-106,- Kuhn et al . (1995) Scene. 
269 . 1427-1429). Current protocols for using exdssxve 
site-specific recombination to remove selectable markers 
include transiently transfecting ES cell clones ««h . 
recombinase expression vector «=u et al . (1.93, Cell 
IS 73- 1155-1164, . micro injecting fertilized oocytes contains 
che recombinant allele with a recombinase expression vector 

rn „ t al (1996, Biochem Biophys Res Commun 

(Kitamoto et al . ^ 

222 . 742-747; Araki et al . (1995, Proc «atl Acad Sci U 
92-160-164) . or breeding animals and plants containing the 
20 recombinant allele to animals and plants respectively. 
containing a recombinase transgene (schwenk et a . 15 
Nucleic Acids Res 23:5080-5081; Lewandosk. et al . 
curr Biol 7:148- 1 51) . Each of these approaches requires an 
investment of some combination of time, resources and 
25 expertise over that required to generate animals and plants 
with homologously recombined alleles. The most common y 
employed method, the secondary transfection of homologously 
recombined ES cell clones with a recombinase expression 
vector, additionally requires extended culture trme that 
30 may decrease their potential to enter the germane. 

in principle, marker excision would be substantially 
simplified through the use of ES cells containing 
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recombinase nucleic acid constructs that were expressed in 
the germline, but not to an appreciable extent in the ES 
cells themselves or somatic tissues of animals and plants. 
The lack of ES cell expression would mean that targeting 
5 vectors containing selectable markers flanked by 
recombinase target sites could be used to isolate 
homologous recombinants without fear that the marker would 
be excised during culture. Robust recombinase expression 
in gametes would mean that the marker would be excised in 

10 at least some of the progeny of ES cell chimeras. Only a 
single step would be required to isolate subtle mutations 
and, if two different recombinase systems were employed, 
conditional and tissue-specific alleles could be produced 
with similar improvements in efficiency. A 

15 germline-specif ic recombinase nucleic acid construct could 
also be used to deliver recombined target nucleic acid 
constructs to the early embryo (Lewandoski et al . (1997) 
Curr Biol 7:148-151), so long as the recombined target was 
not detrimental to the terminal stages of spermatogenesis. 

20 Previous reports have shown that expression of nucleic 

acid constructs containing the proximal promoter of the 
mouse protamine 1 (mPl) locus is restricted to haploid 
spermatids in mature mice (Peschon et al . (1987) Proc Natl 
Acad Sci USA 84:5316-5319; Behringer et al . (1988) Proc 

25 Natl Acad Sci USA 85:2648-2652), although low levels of 
ectopic expression may occur in some mature tissues 
(Behringer et al . (1988) Proc Natl Acad Sci USA 
85:2648-2652). Inclusion of the mPl promoter does not 
guarantee expression in the male germline, however, for 

30 although nucleic acid constructs containing the mPl 
promoter and the SV40 T-antigen coding sequence were 
transcribed, the message was not translated at detectable 
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levels in spermatids (Behringer et al . (1988) Proc Natl 
Acad Sci USA 85:2648-2652). 

Accordingly, there is a need in the art for methods to 
m odulate expression of recombined target nucleic acid 
sequences in the early embryo. In addition, there is a 
ne ed in the art for tissue-specific and conditional 
recombinatory tools to create transgenic animals and 
plants. These and other needs in the art are addressed by 
the present invention. 

pFT ^ ncQPPTPTTQN ™ ™* INVENTION 

The present invention meets the need in the art for 
modulating expression of recombined target nucleic acid 
sequences to the early embryo. The present invent ion 
further meets the need in the art for tissue-specific and 
5 conditional recombinatory tools to create transgenxc 
animals and plants. Thus, in accordance with the present 
invention, it has been discovered that nucleic acid 
constructs encoding a germline specific promoter 
operatives associated with a recombinase coding sequence 
,0 lead to efficient recombination of a target nucleic add 
construct in the male germline, but not in other tissues. 
This suggests that such nucleic acid constructs could be 
used for the efficient production of embryos bearing 
conditional, genetically lethal alleles. It has 

25 additionally been discovered that ES cell lines generated 
from one of these transgenic lines could be used in 
combination with targeting vectors that contained 
loxP-flanked selectable markers to isolate homologous 
recombinants containing the marker and functional lox. 
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RP.TEF DESfPTPTTON nF THE FIGURES 

Figure 1 illustrates a schematic of P2Bc and P2Br 
alleles. The positions of the PCR primers used (5'P and 
3-P) are indicated on the diagrams of the P2Bc and P2Br 
5 alleles. 

Figure 2 depicts the targeting of the hoxb-l locus in 
ProCre ES cells using a targeting vector that contains a 
loxP-flanked selectable marker. Top, schematic of the 
wild-type hoxb-l locus showing the positions of the two 
exons (open boxes), the position of a 5 ' Nrul site and 
flanking BamHI restriction endonuclease sites, and PCR 
primers (triangles) that amplify a 204 bp product from the 
wild-type allele that includes the Nrul site. Middle, the 
predicted organization of homologously recombined hoxb-l 
15 allele in which a neomycin cassette (NEO) , flanked by loxP 
sites (L) , has been inserted into the Nrul site shown in 
the top diagram. The insertion creates a novel BamHI site 
and the same PCR primers now amplify a 1600 bp product. 
Bottom: the predicted structure of the recombined allele 
shown in the middle panel after Cre-mediated excision of 
the neomycin cassette to leave a single loxP site in place 
of the Nrul site of the wild-type allele. Amplification 
with the same primers now yields a 268 bp product. 

ngT&TT.F.n nF.SCFTPTTON OF THF, INV ENTION 

in accordance with the present invention, there are 
provided nucleic acid constructs comprising a germline- 
specif ic promoter operatively associated with a recombinase 
coding sequence . 



20 



25 
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As used herein, the term -promoter" refers to a 
specific nucleotide sequence recognized by RNA polymerase, 
the enzyme that initiates RNA synthesis. The promoter 
sequence is the site at which transcription can be 
5 specifically initiated under proper conditions. The 
recombinase nucleic acid(s), operatively linked to the 
suitable promoter, is (are) introduced into the cells of a 
suitable host, wherein expression of the recombinase 
nucleic acid(s) is (are) controlled by the promoter. 



10 



Germline-specific promoters contemplated for use in 
the practice of the present invention include the protamine 
1 gene promoter, the protamine 2 gene promoter, the 
spermatid-specific promoter from the c-kit gene (Albanesi 
et al. (1996) Development 122 (4) : 1291-1302 ) , the sperm- 
15 specific promoter from angiotensin-converting enzyme 
(Howard et al . (1993) Mol Cell Biol 13(1) =18-27; Zhou et 
al. (1995) Dev Genet 16 (2) : 201-209) , oocyte specific 
promoter from the ZP1 gene, oocyte specific promoter from 
the ZP2 gene, oocyte specific promoter from the ZP3 gene 
(Schickler et al . (1992) Mol Cell Biol 12 (1) : 120-127) , and 
the like. 



20 



in addition to the above-described germline-specific 
promoters, tissue-specific promoters specific to plants are 
also contemplated for use in the practice of the present 
25 invention, including, for example, the LAT52 gene promoter 
from tomato, the LAT56 gene promoter from tomato, the LAT59 
gene promoter from tomato Eyal et al . (1995) Plant Cell 
7(3) =373-384) , the pollen- specif ic promoter of the Brassica 
S locus glycoprotein gene (Dzelzkalns et al . (1993) Plant 
30 Cell 5(8):855-863), the pollen- specif ic promoter of the 
NTP303 gene (Weterings et al . (1995) Plant J 8(1) =55-63), 
and the like. 
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Recombinases contemplated for use in the practice of 
the present invention include Cre recombinase. FLP 
recombinase, the R gene product of Zygosaccnzronyces 
(Onouchi et al. (1995) Mol Gen Genet 247 (6) : 653 - 660) , and 
5 the like. 

Presently preferred constructs contemplated for use in 
the practice of the present invention include ProCre 
(comprising the protamine 1 gene promoter operatively 
associated with Cre recombinase) , ProFLP (comprising the 
10 protamine 1 gene promoter operatively associated with FLP 
recombinase), ProR (comprising the protamine 1 gene 
promoter operatively associated with the R gene product of 
Zygosaccharomyces) , and the like . 

in accordance with another embodiment of the present 
15 invention, there are provided nucleic acid constructs 
comprising a conditional promoter or a tissue-specif « 

i -^^^ ot-pd with a recombinase coding 
promoter operatively associated wiun 

sequence . 

Promoters contemplated for control of expression of 
20 recombinase nucleic acid(s) employed in accordance with 
this aspect of the present invention include inducible 
(e g., minimal CMV promoter, minimal TK promoter, modified 
MMLV LTR). constitutive (e.g., chicken 3-actin promoter, 

,.rj a j\ nupp) and/or tissue speciuc 
MMLV LTR (non-modif led) , DHr K J , anu/w 

2 5 promoters. 

Conditional promoters contemplated for use in the 
practice of the present invention comprise transcription 
regulatory regions that function maximally to promote 
transcription of mRJJA under inducing conditions. Examples 
30 of suitable inducible promoters include DHA sequences 
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corresponding to: the B. coli lac operator responsive to 
IPTG (see Nakamura et al . . Cell, 18=1109-1117, 1979),- the 
metallothionein promoter metal-regulatory-elements 
responsive to heavy-metal e.g.. zinc) induction (see Evans 
5 et a i U.S. Patent Bo. 4,870,009), the phage T71ac 
promoter responsive to IPTG (see Studier et al *»«.. 
En2 ymol., 1.5. 60-89, 1990,- and U.S. #4,952,496), the heat- 
shock promoter; the TK minimal promoter,- the CMV minimal 
promoter; a synthetic promoter; and the like. 

Exemplary constitutive promoters contemplated for use 
in the practice of the present invention include the CKV 
promoter, the SV40 promoter, the DHFR promoter, the mouse 
mammary tumor virus (MMTV) steroid- inducible promo ter 
Moloney murine leukemia virus (MMLV) promoter, elongation 
15 factor ic. (EFlO promoter, albumin promoter, APO 

promoter, cyclic AMP dependent kinase II ( CaMKI promoter 
keratin promoter, CD3 promoter, immunoglobulin light or 
heav y chain promoters, neurof iliment promoter, neuron 
specific enolase promoter, L7 promoter, CD2 promoter, 
2 0 myosin light chain kinase promoter, HOX gene promoter^ 
thymidine kinase (TK) promoter, KNA Pol I! promoter, MYOD 
promoter, MYF5 promoter. phophoglycerokinase PGK 
promoter, Stfl promoter. Low Density Lipoprotein (LDL) 
promoter, chicken B -actin promoter (used in conjunction 
25 with ecdysone response element) and the like. 

As readily understood by those of skill in the art, 
ch e term "tissue specific refers to the substantially 
exclusive initiation of transcription in the tissue from 
which a particular promoter, which drives expression of a 
30 given gene, is derived (e.g., expressed only in J-c.ll.. 
endothelial cells, smooth muscle cells, and the like). 
Exemplary tissue specific promoters contemplated for use in 
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c he practice of the present invention include the 
promoter, the «S E propter. the GFAP propter, 
neurotransmitter promoters (e.g., tyrosine hydroxylase , TH, 
choline acetyltransf erase, chAT, and the like,, promoters 
5 £ or neurotropic factors (e.g., a nerve growth factor 
promoter, HT-3. BDNF promoters, and the like), and so on. 

In accordance with yet another embodiment of the 
present invention, there are provided embryonic stem ceils 
contains a nucleic acid construct as described here.. 

AS readily understood by these of skill in the art, 
the above-described constructs can be introduced into a 
varie ty of animal species, such as, for example, mouse, 
ra t. rabbits, swine, ruminants «^ ^ 

humans, poultry, fish, and the like 9 also 
l5 amphibians, insects, nematodes, and the Irke, are a so 
contemplated. Members of the plant kingdom, such as, for 
example, transgenic mono- and dicotyledonous spec e ■ 
including important crop plants,, i.e., wheat, rrce, max 
in lfalfa an d the like, are also 

soybean, potato, cotton, alfalfa, and tne 

20 contemplated. 

F or example, pluripotential ES cells can be derived 
from early pre- implantation embryos, preferably the ova are 
harvested between the eight-cell and blastocyst stages. 

25 cells are maintained in culture long enough to perm- 
integration of the pr omoter-recombina S e nucleic .« 
construct ,s,. The cells are then either injected .nto 
host blastocyst, i.e., the blastocoel of the 
blastocyst, or co-cultured with eight-cell to 

30 ova, i.e., zena-free morula, so that 

are preferentially incorporated into the rnner cell mass - 
th e developing embryo. with blastocyst rn 3 ect 1 cr 
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transgenic offspring are termed "chimeric, " as some of 
their cells are derived from the host blastocyst and some 
transfected ES cells. The host embryos are transferred 
into intermediate hosts or surrogate females for continuous 
5 development. 

The transformation procedure for plants usually relies 
on the transfer of a transgene carrying a particular 
promoter construct via the soil bacterium AgrroJbacterium 
tumefaciens. Transformation vectors for this procedure are 

10 derived from the T-DNA of A . tumefaciens, and transgenes 
are stably incorporated into the nuclear genome. The 
activity of the transgenes can then be monitored in the 
regenerated plants under different conditions. In this 
way, many promoter elements that are involved in complex 

15 regulatory pathways such as light responsiveness or tissue 
specificity have been defined. 

Alternatively, direct {i.e., vectorless) gene 
transfer systems are also contemplated including chemical 
methods, electroporat ion, microinjection, biolistics, and 

20 the like. Protoplasts isolated from the plants can bo 
obtained by treatment with cell wall degrading enzymes. 
DNA can be introduced into plant protoplasts by a number of 
physical techniques including electroporation and 
polyethylene glycol treatment in the presence of MgCl : . 

25 The method of choice for rapid promoter analyses in plants 
is the biolistic method. This technique involves the 
delivery of the particular DNA construct into plant cells 
by microprojectiles , i.e., nucleic acid(s) coated or 
precipitated by tungsten or gold. This method is not 

30 limited to any particular plant species or tissue type. 
Preferably, this method would allow quantitative analysis 
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of transformation if appropriate selectable markers are 
included. 

in a preferred embodiment, the genome of embryonic 
stem cells according to the invention comprise a 
5 transcriptionally active selectable marker flanked by two 
recombination target sites. It is especially preferred 
that the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline- specific 
promoter is selective for the recombination target sites 
10 flanking said selectable marker. 

Optionally, embryonic stem cells according to the 
invention may further comprise one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination target sites which flank 
said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence , 

a second nucleic acid construct comprising a tissue- 
specific promoter operatively associated with a second 
recombinase coding sequence, or the like. Preferably, the 
second recombinase coding sequence will be different than 
the first recombinase coding sequence. 

The ability to select and maintain nucleic acid 
constructs in the host cell is an important aspect of an 
expression system. The most common type of selectable 
cnarker incorporated in the nucleic acid construct is an 
antibiotic resistance element allowing selection with 
ampicillin, kanamycin, neomycin, tetracycline, hygromycm, 
puromycin, blastophycin, and the like. Other approaches 



15 



20 



25 



30 
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employ specially constructed host cells which require the 
selectable marker for survival. Such selectable markers 
include the valine tRNA synthetase, val S, the 
single- stranded DNA binding protein, ssb, thymidine kinase, 
5 or the like. Alternatively, naturally occurring partition 
systems that maintain copy number and select against 
plasmid loss is also contemplated. An example is the 
incorporation of the parB locus. Other selectable markers 
include HPRT and the like. 



10 



Selectable markers specific for plants include, the 
gus A (uid A) . the bar gene, phosphinothricin and the lxke. 

-in accordance with still another embodiment of the 
present invention, there are provided methods for excission 
of the transcriptionally active selectable marker from the 
15 above-described embryonic stem cells, said method 
comprising : 

passaging the genome derived from said embryonic stem 
ells through gametogenesis (i.e., spermatogenesis or 



c 

oogenesis) 



20 



Excission of marker as contemplated herein can cause 
a variety of end results, e.g., deletion of the marker or 
a nucleic acid sequence, gain of function or loss of 
function, replacement of function, and the like, as well as 
modulation of any one or more of these results. 

25 Functions which are contemplated to be manipulated 

include regulating body size and growth rate, including 
recombining gene constructs which contain various growth 
hormone gene sequences. Other productivity traits that are 
targets include altering the properties or proportions of 

30 caseins, lactose, or butterfat in milk, increased 
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resistance to viral and bacterial diseases (i.e., 
"constitutive immunity" or germ-line transmission of 
specific, recombined antibody genes) , more efficient wool 
production, and the like. Other functions which are 
5 contemplated to be modulated include development of lines 
of transgenic animals and plants for use in directing 
expression of transgenes encoding biologically active human 
proteins . 

Agronomic traits which are contemplated to be 
10 modulated by use of the present invention include tolerance 
to biotic an abiotic stress, increased resistance to 
herbicides, pest damage, and viral, bacterial, and fungal 
diseases, improvement of crop quality {i.e., increase in 
nutritional value of food and feed) , reduction of post- 
15 harvest losses, improvement of suitability and enlargement 
of the spectrum for processing (i.e., altered quantity and 
composition of endogenous properties, production of new 
compounds of plant or non-plant origin such as biopolymers 
or pharmaceutical substances) - 

20 In accordance with a still further embodiment of the 

present invention, there are provided methods for the 
production of recombinant alleles, said method comprising: 
introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
25 cells as described herein, and 

passaging the genome derived from said embryonic stem 
cells through gametogenesis . 

As readily recognized by those of skill in the art, 
nucleic acid fragments can be introduced into ES cells by 
30 a variety of techniques, e.g., by homologous recombination, 
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random insertion, retroviral insertion, site specific- 
mediated recombination, and the like. 

Nucleic acid fragments contemplated for use herein 
include fragments containing an essential portion of a gene 
5 of interest . 

in accordance with yet another embodiment of the 
present invention, there are provided methods for the 
production of recombinant alleles, said method comprising: 

introducing at least one recombinase responsive construct 
into embryonic stem cells as described herein, 

wherein said construct (s) comprise (s) a nucleic 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 

wherein said nucleic acid fragment is flanked by 
a second pair of recombination target sites. 



\ 



10 



15 



passaging the genome derived from said embryonic stem cells 
through gametogenesis . 

in a presently preferred aspect, the first pair of 
,0 recombination target sites is recognized by a recombinase 
which is expressed under the control of a germline- specif ic 
promoter and said second pair of recombination target sites 
is recognized by a recombinase which is expressed under the 
control of a conditional promoter or a tissue specific 
2 5 promoter . 

Optionally, the embryonic stem cells employed herein 
can further comprise a second nucleic acid construct 
selected from constructs comprising a conditional promoter 
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operatively associated with a recombinase coding sequence, 
a construct comprising a tissue-specific promoter 
operatively associated with a recombinase coding sequence, 
and the like. 

5 m accordance with still another embodiment of the 

present invention, there are provided methods for the 
conditional assembly of functional gene(s) for expression 
in eukaryotic cells by recombination of individual inactive 
gene segments from one or more gene(s) of interest, 
10 wherein each of said segments contains at least one 

recombination target site, and 

wherein at least one of said segments contains at 
least two recombination target sites, 

said method comprising: 

15 introducing said individual inactive gene 

segments into an embryonic stem cell as described 
herein, thereby providing a DNA which encodes a 
functional gene of interest, the expression product of 
which is biologically active, upon passage of the 

20 genome derived from said stem cells through 

gametogenesis . 

For assembly of functional genes from inactive gene 
segments, see, for example, US Patent No. 5,654,182, 
incorporated herein by reference in its entirety. 



25 



30 



in accordance with a still further embodiment of the 
present invention, there are provided methods for the 
generation of recombinant livestock, said method 
comprising: 

combining embryonic stem cells that include nucleic 
acid construct according to the invention with host 
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pluripotential ES cells derived from early pre-implantation 
embryos, and 

introducing these combined embryos into a host female 
and allowing the derived embryos to come to term. 

5 

In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
generation of recombinant plants, said method comprising 
transforming plant zygotes with nucleic acid constructs 
10 according to the invention and allowing the zygote to 
develop . 

The objective of the current work with ProCre nucleic 
acid constructs was to determine the potential of 
germline-specif ic promoters to implement efficient 

15 approaches utilizing site-specific recombinases to generate 
an array of sophisticated mutations in mammals and plants. 
The data shows that it is possible to create recombinase 
nucleic acid constructs that are expressed at high levels 
in the germ line but not to a functionally significant 

20 extent in either ES cells or embryonic or adult somatic 
tissues. Homologous recombinants with a selectable marker 
can be isolated in ES cells that contain 
promoter- recombinase nucleic acid constructs. Transgenic 
animals and plants bearing the promoter -recombinase nucleic 

2 5 acid constructs and a target allele transmit the recombined 

target to their progeny at high frequencies. These results 
establish the principle that mammals and plants containing 
loci that have been homologously recombined and then 
subsequently site-specifically recombined can be generated 

3 0 simply by using ES cells with a suitable recombinase 

nucleic acid constructs for the initial targeting. By this 
mechanism, alleles containing a single recombinase target 
site and a mutation of interest can be produced in the 
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progeny of ES cell chimeras without any investment of time, 
expertise, or resources over that required to create an 
allele that still contains a selectable marker. The 
paradigm has obvious utility in the production of subtle 
and conditional mutations that require generation of 
alleles with minimal structural alterations. Because the 
presence and transcriptional activity of selectable markers 
can contribute to phenotypes in an unanticipated and 
unwanted manner (Fiering et al . (1995) Genes Dev 
9:2203-2213); Olson et al . (1996) Cell 85:1-4), the 
approach will also useful for generating null alleles. 



Expression of the endogenous mPl locus (Hecht et al . 
(1986) Exp Cell Res 164:183-190), and mPl-driven nucleic 
acid constructs (Behringer et al . (1988) Proc Natl Acad Sci 

15 USA 85:2648-2652; Braun et al . (1989) Nature 337:373-376; 
Zambrowicz et al . (1993) Proc Natl Acad Sci USA 
90:5071-5075) is restricted to haploid spermatids. 
Expression of mPl nucleic acid construct expression 
typically begins at haploid stages, and both RNA (Caldwell 

20 and Handel (1991) Proc Natl Acad Sci USA 88:2407-2411) 
and proteins (Braun et al . (1989) Nature 337:373-376) 
diffuse through the spermatogenic syncytium. The result is 
a highly efficient recombination of target alleles and the 
segregation of recombinase and target nucleic acid 

2 5 constructs in the first generation. 

Cre-mediated recombination proved to be highly testis- 
specific in ProCre mice. It is clear that the nucleic acid 
constructs are not expressed in the inner cell mass or in 
other early embryonic tissues. Cells from pre-implantation 

3 0 embryos intermingle extensively and the embryo as a whole 

is derived from a small number of cells (Beddington et al . 
(1989) Development 106:37-46; Soriano and Jaenisch (1986) 
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Cell 46:19-29). If ProCre nucleic acid constructs 

recombined target sequences during pre-implantation stages, 
at least a few percent of the cells in many tissues would 
contain the P2Br allele and Southern and PCR analyses 
5 showed that this was not the case. The ectopic Cre 
activity seen in some ProCre strains probably resulted from 
low levels of recombinase expression in later embryos or 
mature tissues, a finding consistent with the expression 
patterns of other mPl -driven nucleic acid constructs. 

10 Northern analyses have failed to reveal the expression of 
mPl -containing nucleic acid constructs in a variety of 
mature tissues (Peschon et al . (1987) Proc Natl Acad Sci 
USA 84:5316-5319; Behringer et al . (1988) Proc JVatl Acad 
Sci USA 85:2648-2652; Peschon et al . (1989) Ann N Y Acad 

15 Sci 564:186-197; Zambrowicz et al . (1993) Proc Natl Acad 
Sci USA 90:5071-5075), but nucleic acid constructs 
containing the mPl promoter and the SV40 T-antigen led to 
the consistent development of tumors of the petrosal bone 
and right cardiac atrium (Behringer et al . (1988) Proc Natl 

20 Acad Sci USA 85:2648-2652). 

PCR assays represent a very sensitive assay for 
whether sufficient levels of Cre protein were produced to 
effect recombination. Importantly, they measured the 
cumulative level of recombination, for events that occurred 

25 at any stage of development are likely to have been 
propagated to, and might be amplified in, descendant 
populations. The highest level of ectopic recombination 
was that observed in cardiac ventricular tissue of 
strain which generated a signal approximately equivalent to 

30 that expected if the ratio between recombined and 
unrecombined alleles were 1:104. The activities observed 
in other strains were considerably lower than this, and one 
strain did not show any ectopic activity. None of the 
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strains showed evidence of recombination in the cardiac 
atria and the petrosal bone was not examined. These assays 
did not rule out the possibility that higher levels of 
recombination occur in tissues that were not examined or 
5 that the . low levels of recombination observed in some 
tissues reflected high levels of recombination in some 
component cell population. 

These low levels of ectopic' activity suggest that 
mpl -driven recombinase nucleic acid constructs could be 

10 used for the production of embryos containing genetically 
lethal alleles. Some alleles created by homologous 
recombination in ES cells will prove to be lethal in 
heterozygotes, as was the case for an mRNA editing mutation 
of the GluR2 glutamate receptor subunit (Brusa et al . 

15 (1995) Science 270:1677-1680) . Germline transmission would 
be restricted to rare chimeras in which the level of 
chimerism was low enough in tissues affected by the 
mutation to allow survival and high enough in the germline 
to allow transmission. This problem could be circumvented 

20 by creating recombinase -conditional mutations in ES cells 
bearing mpl -recombinase nucleic acid constructs, or by 
making the same mutations in standard ES cells and then 
introducing the- mpl -recombinase nucleic acid construct by 
breeding. So long as the recombined version of the allele 

2 5 did not adversely impact terminal stages of 

spermatogenesis, embryos containing the recombined allele 
could be efficiently produced. Embryos containing 

recombined nucleic acid constructs can also be produced 
through the activity of Cre nucleic acid constructs that 

3 0 are expressed during early embryogenesis from the human 

cytomegalovirus minimal promoter (Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081), the adenovirus Ella 
promoter (Lakso et al . (1992) Proc Natl Acad Sci USA 
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89 . 6232-6236). or the zP3 promoter (Lewandoski et al. 
„ 997) Curr Biol 7=148-151). OroCre and 2 P3 nucleic acid 
constructs have the advantage of delivering a ««-b^ 
allele to the zygote, guaranteeing that all cells in the 
S derived embryos will contain the allele. 

procre ES cells are but one o£ many different kinds of 
recombinase-bearing ES cells that could significantly 
snorten the time and effort reauired for a wide variety of 
genetic manipulations in mice. The most obvious of these 
L0 are complementary ProP,P ES cell, 

recombinase was derived from s. cerevisae (Broach and H-«. 
(1980 ) Cell 21:501-508) or another species (Kuhn et a.. 
(1 995) science 269:1427-1429). Conceptually distinct from 
hese but perhaps as generally useful would be ES cells 
1S bearing inducible recombinase nucleic acid constructs that 
would facilitate temporal control of recombinase expression 
ln ES cells, chimeras, and their progeny to 
sit e-specifically recombined alleles (Araki et al. (1992) 
, Mol Biol 225.25-37,- No et al . (1996) -oc *atl Acad sc. 
20 USA 93 -.3346-3351.- Logie and Stewart ,1995, Proc Natl Acad 
20 USA (1 996) Proc Natl Acad 

Sci USA 92:5940-5944; Fell et al . {193b) 

Sci USA 93:10887-10890) . Finally, fusion genes that led 
to recombinase expression in specific tissues could be used 
to address specific research objectives. 

25 The invention will now be described in greater detail 

by reference to the following non-limiting examples. 



Pvample 1 
Mammalian ™™ instructs 



30 



A 652 bp fragment of the mPl promoter (SEQ ID NO : - ; 
Peschon et al . (1989) Annals of the Nev Yor* Academy of 
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Sciences 186-197) was isolated by PGR using PGR primers 
(SEQ id NOs:2 and 3) and genomic DNA templates from CCE ES 
cells (Robertson et al. (1986) Nature 323 : 445-448) . Thxs 
fragment was fused to a modified Cre coding sequence (SEQ 
s ID NO- 4) which contains a consensus translation start site 
(Kozak (1986) Cell 44:283-292) , 11 codons for a human c-myc 
epitope (Evan et al . (1985) Mol Cell Biol 5:3610-3616), 
" 7 codons for a minimal SV40 nuclear localization sxgnal 
(Kalderon et al . (1984) Cell 39:499-509) and the 
10 polyadenylation signal from pIG-Gre in the plasmid P OG304M 
(SEQ ID NO:5). The Cre expression plasmid pOG231 was 
prepared by fusing a modified Cre coding sequence- from 
pI C-Gre (Gu et al . (1993) Cell 73:1155-1164), and 
containing the same translation start and nuclear 
15 localization signal, to the synthetic intron and CMV 
promoter of P 0G44 (O'Gorman et al . (1991) Science 
251:1351-1355) . 

A plasmid, POG277 (SEQ ID NO:7), containing a 
loxP-flanked neomycin cassette was prepared by inserting a 
20 wild-type loxP site (SEQ ID NO : 8 ; Hoess et al . (1982) Proc 
Natl Acad Sci USA 79:3398-402) into pBSKS (Stratagene) 
and then cloning the neomycin expression cassette from 

,i Cell 51:503-512) 

pMClneo-polyA (Thomas et al . (1987) 

between interactions of this loxP site. The hoxb-l 
25 targeting construct consisted of the PGK-TK cassette from 
pPNT (Tybulewicz et al . (1991) Cell 65:1153-63), and l.4kb 
and 10.2Kb of sequences 5- and 3 • to an Nru I site 800 bp 
5 . to the hoxb-l transcriptional start site isolated from 
a 129 strain genomic library (Stratagene). The 
30 loxP-flanked neo cassette from pOG277 was inserted into the 
Nrul site. The P OG277 neomycin cassette and a (3 -GAL 
sequence was inserted into the first exon of the large 
subunit of RNA polymerase II (RP2) (Ahearn et al . (1987) 
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J. Biol. Chem. 262:10695-10705) to create the P2Bc allele 
(Figure 1) . Cre-mediated recombination of the P2Bc allele 
results in the deletion of the neomycin cassette (Neo) of 
P2Bc that is flanked by two loxP sites, leaving a single 
loxP site and fusing the B-Gal coding sequence to the 
initial codons of the RNA polymerase II coding sequence. 
Recombination increases the size of a Pst I fragment 
recognized by the RP2 probe, which is external to the 
targeting vector used, indicated by the shaded box below 
each allele. 



Bvample 2 
Production of t ransgenic mice 

Fertilized oocytes obtained from matings of 129/SvJae 
(Simpson et al . (1997) Nat Genet 16:19-27) and BALB/c X 
15 C57BL/6 Fl mice were used for pronuclear injections of the 
Protamine-Cre fusion gene from pOG304M according to 
standard protocols (Hogan et al . Manipulating the Mouse 
Embryo: The Manual, Coldspring Harbor Press (1994), pg. 
497) . Production of ES cells and homologous recombinants: 
20 Heterozygous ProCre 129/SvJae males were mated to 
129/ S vEms - + Ter? / J females (Simpson et al . (1997) Nat Genet 
16:19-27) to produce blastocysts that were cultured 
according to standard protocols (Robertson (1987) 
Teratocarcinomas and embryonic stem cells, a practical 
25 approach, eds . E. J. Robertson (IRL Press), pp. 71-112). 
The sex (King et al . (1994) Genomics 24:159-68) and ProCre 
status of each line were determined by PCR assays. 
Molecular analyses: Tail biopsy genomic DNA was used for 
hybridization assays or PCR assays to identify ProCre and 
30 P2Bc/r mice. PCR reactions for the detection of ectopic 
Cre activity used 100 ng of genomic DNA as a template to 
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amplify a P2Br-specif ic product using 'a 5- primer from the 
R P2 promoter and a 3 • primer from the P -GAL coding sequence 
(Figure 1) . Thirty cycles of amplification were done in a 
total volume of 100 fil using 300 ng of each primer, 3 mM 
MgCl2, 1-5 units of Taq polymerase, and an annealing 
temperature of 60°C. Southern blots of reaction products 
were hybridized with a probe specific for the P2Br reaction 
product . 

Rxample 3 

P ^rr* yrnrrlfti.e A c ^ rnnfltrurfs Efficiently Pecombine 

Tar gp.r Alleles 

A total of nine founder animals with ProCre nucleic 
acid constructs were obtained from injections of a 
Protamine-Cre fusion gene. Two lines were derived from 
injections of !29SvJae (Simpson et al . (1997) Nat Genet 
16:19-27) embryos, and seven . from injections of CB6F2 
embryos. The 129/SvJae lines and three randomly selected 
hybrid lines were examined in detail. To determine whether 
ProCre nucleic acid constructs would efficiently recombine 
a target allele, males were generated that contained a 
ProCre nucleic acid construct and a target for Cre-mediated 
recombination. This »P2Bc» (Pol XI, J5-GAL, conditional) 
target (Figure 1) was created using homologous 
recombination in ES cells to insert a loxP- flanked neomycin 
cassette and a (J - GAL coding sequence into the first exon of 
the locus coding for the large subunit of RNA 
polymerase II. Cre-mediated recombination of the loxP 
sites was expected to delete the intercalated sequences, 
creating »P2Br" allele (£ol IX, fi-Gal, recombined) . 



These males were mated to wild-type females and the 
resulting progeny were examined by Southern blotting to 
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determine if they inherited the P2Bc or the P2Br allele, 
and to additionally determine the segregation pattern of 
ProCre nucleic acid constructs and P2Br alleles. Southern 
blot of Pst I digested tail biopsy DNA's from a +/P2Bc, 
5 + /ProCre male (sire) and four of his progeny by a wild-type 
female probed with n RP2 probe (top) and then reprobed with 
a Cre probe (bottom) . The large majority of transmitted 
target alleles were Cre -recombined P2Br alleles (Table 1) . 
ProCre nucleic acid constructs and recombined target 
10 alleles segregated independently in the first generation; 
approximately 50% of mice that inherited a P2Br allele also 
inherited their male parent's ProCre nucleic acid 
construct. All RP2 mutant alleles in the progeny were 
P2Br, and some progeny inherit a P2Br allele without 
15 inheriting ProCre nucleic acid construct. Mouse 4 did not 
contain a ProCre nucleic acid construct and is homozygous 
wild-type at the RP2 locus. These data establish that 
ProCre nucleic acid constructs efficiently recombine the 
P2BC allele in the male germline and that the recombined 
20 P2Br alleles and ProCre nucleic acid constructs segregate 
in the first generation. Because significantly more than 
25% of the- progeny inherited recombined target alleles,, 
recombination either occurred during diploid stages of 
spermatogenesis or Cre generated during haploid stages of 
25 spermatogenesis was distributed among spermatids through 
cytoplasmic bridges (Braun et al . (1989) Nature 
337:373-376), effecting recombination in spermatids that 
did not themselves contain a ProCre nucleic acid construct . 

The progeny of matings between ProCre males and +/P2BC 
30 females were also examined to determine if male gametes 
from ProCre mice delivered enough Cre to zygotes to effect 
Cre-mediated recombination of a target sequence. Of 96 
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progeny examined by Southern blotting, none contained a 
Cre-recombined P2Br allele. 

It has also been discovered that a loxP- flanked neo 
cassette in the glutamate receptor R6 subunit locus is 
5 efficiently recorabined by ProCre nucleic acid constructs in 
mice . 

Rxample 4 

PrnCre Nucleic acid construct Exp ression is Highly 

Tissue -Specific 



10 



Genomic DNAs from ten different tissues of five- to 
seven-week old males that contained both a ProCre nucleic 
acid construct and a P2Bc target allele were analyzed in 
Southern blots. Southern blots were prepared of Pst I 
digested DNA from testes (T) and one other tissue (K f 
15 kidney; B, brain; S, spleen) of males heterozygous for one 
of four ProCre nucleic acid constructs and the P2Bc allele. 
Testis DNA from each male shows a P2Br allele signal, in 
addition to those generated by the wild-type RP2 (WT) and 
P2Bc alleles. Other tissues show only the WT and P2Bc 
20 signals. Only the testis samples showed signal indicating 
Cre-mediated recombination of the target. The intensity of 
the P2Br signal relative to that of the wild-type allele 
ranged from 10% to 22% for different ProCre strains and did 
not correlate with the ProCre nucleic acid construct copy 
2 5 number. The copy number of ProCre nucleic acid constructs 
varied among lines showing similar levels of .recombination 
in testis. For example, restriction patterns and 

densitometric analyses showed that line 58 contained a 
single copy of the ProCre nucleic acid construct, yet 
30 showed virtually the same testis recombination signal as 
line containing more than 100 copies. This variability is 
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similar to results obtained with other mPl promoter -driven 
nucleic acid constructs (Peschon et al . (1987) Proc Natl 
Acad Sci USA 84:5316-5319; Zambrowicz et al . (1993) Proc 
Natl Acad Sci USA 90:5071-5075). 

5 A s a more sensitive measure of ectopic recombination, 

PGR amplifications were performed on the same samples. The 
amplification primers were expected to produce a 325 bp 
product from the recombined target and a 1.4 kb fragment 
from the unrecombined allele (Figure 1) . The assay was 
10 expected to measure the cumulative level of recombination, 
for any P2Br alleles formed during transient expression of 
Cre during development would be preserved and perhaps 
amplified in descendant cells. Low levels of ectopic 
recombination product were observed in some tissues of all 
15 ProCre lines except for one. A southern blot of PGR 
amplification products of the P2Br allele utilized tissues 
from a male heterozygous for the ProCre nucleic add 
construct and the P2Bc allele. DNA from 10 different 
tissues was amplified using primers and conditions that 
20 produced a 350 bp product from the recombined, P2Br allele. 
Each lane contains 10% of the reactions, except for . the 
testis reactions, which were diluted 500 (T5) , 250 (T2) , 
and 100 (Tl) fold prior to loading, and a liver 
reconstruction control (C) . which was diluted 1:100 before 
25 loading. The highest level of ectopic activity was 

observed in cardiac ventricular muscle of mice; in these 
samples the ectopic signal was more than 100 fold lower 
than that observed in testis. Three strains showed much 
lower levels of recombination in brain tissue, and 
30 strain 75 additionally showed ectopic activity in spleen. 
Despite the difficulty of quantifying PCR results, these 
data clearly indicate that ectopic activity occurred at 
very low levels in most tissues of most ProCre lines. 
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Example _ 5 

Isolation of Homologously Recombined ProCre ES Cell 
Clones Using Targeting Vectors with a loxP-Flanked 

Selectable Marker 

Four male +/ProCre ES cell lines were established from 
129/Sv strain ProCre transgenic mice. In preliminary 
experiments, passage 5 cells from one of these lines (PC3) 
were used to generate three male chimeras with between 50 
and 95% coat color chimerism. In matings with C57BL/6 
females, two of these male chimeras have sired a total of 
11 pups, all bearing the Agouti coat color signifying 
germline transmission of the ES cell genome, and 6 of 9 
pups genotyped additionally contained the line 70 ProCre 
nucleic acid construct. The frequency of germline 
transmission has not yet been determined, nor has it been 
determined whether competency for germline transmission 
will persist in homologously recombined ProCre ES cells at 
later passages. 

To determine if homologously recombined ProCre ES cell 
20 clones could be isolated using targeting vectors that 
contained a loxP-flanked selectable marker, two 
transf ections were done using variants of a targeting 
vector in which a loxP- flanked neomycin cassette was 
inserted into an Nru I site in the hoxb-1 locus promoter 
2 5 (Figure 2) . A Southern blot of BamHI -digested genomic DNAs 
were harvested from a 96-well plate from 10 doubly-selected 
ES cell clones and hybridized with a probe (shown in Figure 
2) which is external to the targeting construct. All 
samples show the 7.5 kb band from the wild-type allele and 
30 four clones additionally show the 6 kb band predicted to 
result from homologous recombination. In these 
transf ections, 12 of 62 (19%) PC3- and 10 of 56 (18%) 
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PCs-derived clones that were ganciclovir and G418 -resistant 
(Mansour et al . (1988) Nature 336:348-352) were found to be 
homologously recombined. In two parallel transf ections of 
CCE cells (Robertson et al . (1986) Nature 323:) with the 
5 same vectors, 3 2 of 93 (34%) and 15 of 132 (11%) clones 
were homologously recombined. The total numbers of 
G418 -resistant clones recovered from ProCre ES cell 
transfections were reduced relative to the parallel CCE 
transf ections. This may be attributable to both 

10 Cre -mediated excision of the neomycin cassette and to the 
fact that the transfections were done under electroporat ion 
conditions optimized for CCE cells. 

Because it was formally possible that the homologously 
recombined clones contained inactive loxP sites, five 
15 homologously recombined PC3 ES cell clones and the parental 
PC3 cell line using the primers shown in Figure 2 were 
either mock transfected or transiently transfected with the 
pOG231 Cre ' expression vector. For the transient 

transfection assay, DNA was harvested 48 hours after 
20 transfection and used in PCR assays to assess whether the 
loxP sites in the recombinant clones could be recombined by 
Cre. In all cases a clear recombination signal was • 
observed in the pOG231 transfected sample. The recombinant 
clones and parental cell lines show the 204 bp 
25 amplification product of the wild-type allele, and the 
recombinant clones additionally show a 1600 bp product 
(1600) resulting from amplification across the neomycin 
cassette and a nonspecific 1100 bp amplification product 
(NS) . The pOG231-transfected recombinant clones show an 
30 additional 268 bp product signaling the Cre-mediated 
excision of the neomycin cassette from the recombinant 
alleles of some cells. Experiments were also done to 
assess the stability of the loxP-flanked neo cassette in 
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ProCre ES cells. Five recombinant clones were grown in the 
presence of G418 for two weeks, and then aliquots of each 
were grown either in the presence or absence of G418 for a 
further 10 days. PCR assays were performed to determine if 
5 cre-recombined alleles were present in any of these samples 
and none was observed in the mock transfected controls. 
These data suggest that there is not enough Cre activity to 
significantly influence either the ability to isolate 
recombinant clones or the stability of the selectable 
10 markers in those clones, establishing that the loxP sites 
in these clones were functional. 

To determine if there was any detectable Cre activity 
in ProCre ES cells, aliquots of two lines (PC3 and PCS! 
were transiently transfected with the targeting vector used 
15 to create the P2Bc allele. DNA was recovered 48 hours 
after transfection and used for PCR amplifications of the 
P2Br plasmid molecules that would be generated by 
extrachromosomal Cre-mediated recombination. Small amounts 
of recombination product were seen in both ProCre ES cell 
20 transfections, and none was observed in parallel samples of 
CCE ES cells. This shows that the ProCre ES cell lines 
express sufficient Cre to recombine some extrachromosomal 
targets when the latter are present at high copy numbers. 
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Fvample 6 
plant- nwA Co nstructs 

To define sequences in the LAT52 and LAT59 promoters 
involved in expression in pollen, proximal promoters were 
5 constructed employing a series of linker substitution 
mutants using the particle bombardment system (Klein et al . 
(1987) Nature 327:70-73; Twell et al . (1989b) Plant Physiol 
91-1270-1274). These experiments were performed by co- 
bombarding the test plasmids (luciferase [LUC] recombinase 
10 fusions) with reference plasmids (S-glucuronidase [GUS] 
fusions) . The latter served as a control for bombardment 
variability and allowed ' comparisons to be made between 
independent bombardments. 

The context of the -100 promoter in LAT52 and the -115 
15 promoter in LAT59 was chosen because these promoters 
appeared to be the minimal regions that still conferred 
high levels' (25% relative to the available full-length 
promoter) of pollen- specif ic expression (Twell et al . 
(1991) Gen Dev 5:496-507). These minimal promoters were 
then fused to the Cre coding sequence operatively linked to 
the luc gene (Ow et al . (1986) Science 234:856-858) coding 
region, and the resulting plasmids served as a basis for 
creating the nucleic acid constructs. The LAT52 linker 
substitutions were performed in P 52LUC, which contain 
25 entire LAT5 2 5' untranslated region (5' UTR) . A series of 
six 9- to 10-bp-long linker substitutions were made m 
P 52LUC, spanning the region -84 to -29 (52LS1 to 52LS6) . 
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Ryample 7 
Tissue Specificit y in Plants 

The results obtained by transient expression in pollen 
and in transgenic plants provided information on the effect 
of the various constructs on expression in pollen but not 
on their effect on tissue specificity. A tobacco cell 
culture, TXD (maintained as described by Howard et al . 
(1992) Cell 68:109-118), was, therefore, added as an 
additional component of the transient assay system. The 
TXD cell culture was initiated from tobacco mesophyll cells 
and therefore represents somatic tissue, as opposed to the 
gametophytic tissue represented by pollen. Cells in 
culture were chosen, rather than intact tissue, as the 
somatic tissue source because such cells superficially 
15 resemble pollen in that they can be spread out as a 
monolayer on a plate before bombardment. 

in this experiment, translation fusions between the 
luc coding region and either the CaMV 35S promoter drove 
strong expression in cell culture but negligible expression 
in pollen, whereas the LAT52 promoter showed the opposite 
pattern of strong activity in pollen and negligible 
activity in cell culture. Thus, the transient assay system 
mimics the expression pattern observed for these promoters 
in transgenic plants (Twell et al . (1991) Genes Dev 5:496- 
507) . This differential expression provided us with a tool 
with which to address tissue specificity. 



20 



25 



Rxample 8 

Pla ^ Transformers and A nalysis of Transgenic Plants 

Constructs cloned into P Binl9 were introduced into 
30 tomato [Lycopersicon esculentum cv VF36) by Agrobacterium 
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tumefaciens LBA4404 as previously described (McCormick 
(1991b) Transformation of tomato with Agrobacterium 
tumefaciens, In Plant Tissue Culture Manual, K. Linsey, Ed 
B6:l-9). At least 20 independent transf ormants were 
5 obtained for each construct. 

For S-glucuronidase (GUS) assays, 5 to 20 fih of 
pollen, pooled from several flowers of the same plant, was 
ground directly in Eppendorf tubes in 50 to 100 of GUS 
extraction buffer (Jefferson et al . (1987) EMBO 6:3901- 

10 3907) using a Teflon- tipped homogenizer driven by a drill. 
Expression in pollen was measured by f luorometrically 
assaying GUS activity in supernatants of pollen extracts 
using 2mM 4 -methylumbellif eryl S-D-glucuronide (Sigma) as 
substrate (Jefferson et al.(1987) EMBO 6:3901-3907). GUS 

15 activity was corrected for variation in total protein 
content using a bicinchoninic acid protein assay kit 
(Pierce, Rockford, ID . 

Expression in leaves, flowers, stems, roots, and seed 
was tested histochemically by staining with 5-bromo-4- 
20 chloro-3-indolyl S-D-glucuronide (Molecular Probes, Eugene, 
OR) as described previously (Jefferson et al.(1987) EMBO 
6:3901-3907). Expression in leaves was also analyzed 
f luorometrically as given previously. 



Example 9 

2 5 Transient Transformation of Tobacco Pollen 

and Cell Culture 

Pollen spread out as a monolayer was bombarded 
essentially as previously described (Twell et al . (1991) 
Genes Dev 5:496-507) , except that gold was substituted for 
30 tungsten and only 1 /xg of test plasmid and used per plate. 
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TXD cell culture (maintained as described by Howard et al . 
(1992) Cell 68:109-118) was spread out similarly as a 
monolayer (1 mL of a 50-mL stationary culture per plate) 
and bombarded as previously described. Between six and 12 
5 independent bombardments were performed for each construct. 
In each experiment, the test plasmid was co-bombarded with 
a reference plasmid: pB1223 (Clontech, Palo Alto, CA) was 
used for assays of all constructs in tobacco cell culture; 
pLAT59-12 (Twell et al . (1990) Development 109:705-713) for 

10 assays of LAT52 and LAT56 constructs in tobacco pollen; 
pLAT56-12 (Twell et al . (1990) Development 109:705-713) for 
assays of LAT59 constructs in tobacco pollen. Processing 
of the tissue after - 15 to 17 hr and analysis of GUS and 
LUC activity- were as described previously (Twell et al . 

15 (1991) Genes Dev 5:496-507). Transient expression was 
reported as "relative LUC activity, " which represents the 
ratio between the test (LUC) and the reference (GUS) 
plasmids . 

While the invention has been described in detail with 
20 reference to certain preferred embodiments thereof, it will 
be understood that modifications and variations are within 
the spirit and scope of . that which is described and 
claimed. 
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That which is claimed is: 

1. A nucleic acid construct comprising a germline- 
specific promoter operatively associated with a recombinase 
coding sequence . 

2. A nucleic acid construct according to claim 1 
wherein said germline-specif ic promoter is the protamine 1 
gene promoter, the protamine 2 gene promoter, the 
spermatid- specific promoter from the c-kit gene, the sperm- 
specific promoter from angiotensin-convert ing enzyme, 
oocyte specific promoter from the ZP1 gene, oocyte specific 
promoter from the ZP2 gene, or oocyte specific promoter 
from the ZP3 gene. 

3. A nucleic acid construct according to claim 1 
wherein said. germline-specif ic promoter is the LAT52 gene 
promoter from tomato, the LAT56 gene promoter from tomato, 
the LAT59 gene promoter from tomato, the pollen-specific 
promoter of-the Brassica S locus glycoprotein gene, or the 
pollen-specific promoter of the NTP303.gene. 

4. A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes Cre 
recombinase . 

5. A nucleic acid construct according to claim 4 
wherein said construct is ProCre, comprising the protamine 
1 gene promoter operatively associated with Cre 
recombinase . 
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6. A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes FLP 
recombinase . 

7. A nucleic acid construct according to claim 6 
wherein said construct is ProFLP, comprising the protamine 
1 gene promoter operatively associated with FLP 
recombinase . 

8. A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes the R gene 
product of Zygosaccharomyces. 

9. A nucleic acid construct according to claim 8 
wherein said construct is ProR, comprising the protamine 1 
gene promoter operatively associated with the R gene 
product of Zygosaccharomyces. 

10. A nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence . 

11. A nucleic acid construct comprising a tissue- 
specific promoter operatively associated with a recombinase 
coding sequence . 

12. Embryonic .stem cells containing a nucleic acid 
construct according to claim 1 . 

13. Embryonic stem cells according to claim 12 
wherein the genome thereof comprises a transcriptionally 
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active selectable marker flanked by two recombination 
target sites. 

14. Embryonic stem cells according to claim 13 
wherein the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline-specif ic 
promoter is selective for the recombination target sites 
flanking said selectable marker. 

15. Embryonic stem cells according to claim 13 
further comprising one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination target sites which flank 

said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence, or 

a nucleic acid construct comprising a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence . 

16. Embryonic stem cells containing a nucleic acid 
construct according to claim 2 . 

17. Embryonic stem cells containing a nucleic acid 
construct according to claim 3 . 

18. Embryonic stem cells containing a nucleic acid 
construct according to claim 4. 
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19. Embryonic stem cells containing a nucleic acid 
construct according to claim 5. 

20. Embryonic stem cells containing a nucleic acid 
construct according to claim 6 . 

21. Embryonic stem cells containing a nucleic acid 
construct according to claim 7 . 

22. Embryonic stem cells containing a nucleic acid 
construct according to claim 8 . 

23. Embryonic stem cells containing a nucleic acid 
construct, according to claim 9 . 

24. Embryonic stem cells containing a nucleic acid 
construct according to claim 10. 

25. Embryonic stem cells according to claim 24 
wherein the genome thereof comprises a transcriptionally 
active selectable marker flanked by two recombination 
target sites . 

26. Embryonic stem cells containing a nucleic acid 
construct according to claim 11. 

27. Embryonic stem cells according to claim 26 
wherein the genome thereof comprises a transcriptionally 
active selectable marker flanked by two recombination 
target sites. 
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28. A method for excission of the transcriptionally 
active selectable marker from the embryonic stem cells of 
claim 13, said method comprising: 

passaging the genome derived from said embryonic stem 
cells through gametogenesis . 

29. A method according to claim 28 wherein said 
genome is passaged through spermatogenesis. 

30. A method according to claim 28 wherein said 
genome is passaged through oogenesis. 

31. A method according to claim 28 wherein said 
embryonic stem cells further comprise one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination targec sites which flank 

said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence, or 

a nucleic acid construct comprising a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence . 

32. A method for the production of recombinant 
alleles, said method comprising: 

introducing a nucleic acid fragment flanked by at 
lea st two recombination target sites into embryonic stem 
cells according to claim 10, and 
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passaging the genome derived from said embryonic stem 
cells through gametogenesis . 

33. A method according to claim 32 wherein said 
nucleic acid fragment comprises an essential portion of a 
gene of interest . 

34. A method according to claim 32 wherein said 
nucleic acid fragment is introduced by homologous 
recombination, random insertion, retroviral insertion, or 
site specific-mediated recombination. 

35. A method for the production of recombinant 
alleles, said method comprising: 

introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
cells according to claim 13, and 

passaging the genome derived, from said embryonic stem 
cells through gametogenesis. 

36. A method according to claim 35 wherein said 
embryonic stem cells further comprise a second nucleic acid 
construct selected from the group consisting of a construct 
comprising a conditional promoter operatively associated 
with a recombinase coding sequence and a construct 
comprising a tissue- specific promoter operatively 
associated with a recombinase coding sequence. 



37. A method according to claim 36 wherein the 
recombinase encoded by said second construct is expressed 
in response to inducing conditions. 



WO 99/10488 



40 



PCT7US98/17852 



38. A method according to claim 36 wherein the 
recombinase encoded by said second construct is expressed 
in a tissue selective manner. 

39. A method according to claim 35 wherein the 
recombination target sites flanking said nucleic acid 
fragment are recognized by a recombinase which is expressed 
under the control of a conditional promoter or a tissue 
specific promoter. 

40. A method for the production of recombinant 
alleles, said method comprising: 

introducing at least one recombinase responsive construct 
into embryonic stem cells according to claim 10, 

wherein said construct (s) comprise (s) a nucleic 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 

wherein said nucleic acid fragment is flanked by 
a second pair of recombination target sites, 

passaging the genome derived from said embryonic stem cells 
through gametogenesis . 

41. A method according to claim 40 wherein said first 
pair of recombination target sites is recognized by a 
recombinase which is expressed under the control of a 
germline-specific promoter and said second pair o 
recombination target sites is recognized by a recombinas. 
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which is expressed under the control- of a conditional 
promoter or a tissue specific promoter. 

42. A. method according to claim 40 wherein said 
embryonic stem cells further comprise a second nucleic acid 
construct selected from the group consisting of a construct 
comprising a conditional promoter operatively associated 
with a recombinase coding sequence and a construct 
comprising a tissue-specific promoter operatively 
associated with a recombinase coding sequence. 

43. A method for the conditional assembly of 
functional gene(s) for expression in eukaryotic cells by 
recombination of individual inactive gene segments from one 
or more gene(s) of interest, 

wherein each of said segments contains at least one 

recombination target site, and 

wherein at least one of said segments contains at 
least two recombination target sites, 

said method comprising: 

introducing said individual inactive gene 
segments into an embryonic stem cell according to 
claim 10, thereby providing a DNA which encodes a 
■ functional gene of interest, the expression product of 
which is biologically active, upon passage of the 
genome derived from said stem. cells through 
gametogenesis . 

44. A method for the generation of recombinant 
livestock, said method comprising: 
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combining embryonic stem cells that include a nucleic 
acid construct according to claim 1 with host 
pluripotential ES cells derived from early preimplantat ion 
embryos , and 

introducing these combined embryos into a host female 

and 

allowing the derived embryos to come to term. 

45. A method for the generation of recombinant 
plants, said method comprising transforming plant zygotes 
with nucleic acid constructs according to claim 1 and 
allowing the zygote to develop. 
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SEQUENCE LISTING 



<110> O'Gorman, Steve 
Wahl, Geoffrey 

<120> Site-Specific Germline Recombination 
Eukaryotes and Constructs Useful Therefor 



<130> Salk2190 

<150> 08/919, 501 
<151> 1997-08-28 

<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 652 
<212> DNA 

<213> Mus musculus 



gtctagtaat gtccaacacc tccctcagt 

<210> 3 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<400> 3 

ctctgagcca gctcccggcc aagccagcac 

<210> 4 
<211> 1022 
<212> DNA 

<213> Artificial Sequence 




60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
652 



<210> 2 
<:211> 29 
<212> DNA 

<213> Artificial Sequence 



<400> 2 



29 




60 
120 
180 
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tggaaaatgc ttctgtccgt ttgccggtcg tgggcggcat ggtgcaagtg aataaccgga 240 

aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt caggcgcgcg 300 

gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctc catcgtcggc 360 

ccgggctgcc acgaccaagc gacagcaatg ctgtttcact ggttatgcgg cggatccgaa 420 

aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt 4 80 

tcgaccaggt tcgttcactc atggaaaata gcgatcgctg ccaggataca cgtaatctgg 54 0 

catttctggg gattgcttat aacaccctgt tacgtatagc cgaaattgcc aggatcaggg 600 

ttaaagatat ctcacgtact gacggtggga gaatgttaat ccatattggc agaacgaaaa 660 

cgctggttag caccgcaggt gtagagaagg cacttagcct gggggtaact aaactggtcg 720 

agcgatggat ttccgtctct ggtgtagctg atgatccgaa taactacctg ttttgccggg 780 
tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca gctaccaact cgcgccctgg 
aagggatttt tgaagcaact catcgattga tttacggcgc taaggatgac tctggtcaga 
gatacctggc ctggtctgga cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg 

ctggagtttc aataccggag atcatgcaag ctggtggctg gaccaatgca aatattgtca 1020 
tg 



840 
900 
960 



1022 



<210> 5 
<211> 2293 
<212> DNA 

<213> Artificial Sequence 
<400> S 

gtctagtaat gtccaacacc tccctcagtc caaacactgc tctgcatcca tgtggctccc 
atttatacct gaagcactcg atggggcctc aatgtttcac tagagcccac ccccctgcaa 120 
ctctgagacc ctctggattt gtctgtcagt gcctcactgg ggcgttggat aatttcttaa 180 
aaggtcaagt tccctcagca gcattctctg agcagtctga agatgtgtgc tttcacagtt 
acaaatccat gtggctgttt cacccacctg cctggccttg ggttatctat caggacctag 
cctagaagca ggtgtgtggc acttaacacc taagctgagt gactaactga acactcaagt 
ggatgccatc tttgtcactt cttgactgtg acacaagcaa ctcctgatgc caaagccctg 
cccacccctc tcatgcccat atttggacat ggtacaggtc ctcactggcc atggtctgtg 
aggtcctggt cctctttgac ttcataattc ctaggggcca ctagtatcta taagaggaag 
agggtgctgg ctcccaggcc acagcccaca aaattccacc tgctcacagg ttggctggct 
cgacccaggt ggtgtcccct gctctgagcc agctcccggc caagccagca cccgggacca 
tggagcaaaa gctgatttct gaggaggatc tgggaggacc caagaagaag aggaaggtgt 
ccaatttact gaccgtacac caaaatttgc ctgcattacc ggtcgatgca acgagtgatg 
aggttcgcaa gaacctgatg gacatgttca gggatcgcca ggcgttttct gagcatacct 
ggaaaatgct tctgtccgtt tgccggtcgt gggcggcatg gtgcaagttg aataaccgga 900 
aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt caggcgcgcg QCr ^ 
gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctt catcgtcggt 
ccgggctgcc acgaccaagt gacagcaatg ctgtttcact ggttatgcgg cggatccgaa 
aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt 
tcgaccaggt tcgttcactc atggaaaata gcgatcgctg ccaggatata cgtaatctgg 1200 
catttctggg gattgcttat aacaccctgt tacgtatagc cgaaattgcc aggatcaggg 
ttaaagatat ctcacgtact gacggtggga gaatgttaat ccatattggc agaacgaaaa 
cgctggttag caccgcaggt gtagagaagg cacttagcct gggggtaact aaactggtcg 
agcgatggat ttccgtctct ggtgtagctg atgatccgaa taactacctg ttttgccggg 
tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca gctatcaact cgcgccctgg 1500 
aagggatttt tgaagcaact catcgattga tttacggcgc taaggatgac tctggtcaga 1560 
gatacctggc ctggtctgga cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg 
ctggagtttc aataccggag atcatgcaag ctggtggctg gaccaatgta aatattgtca 
tgaactatat ccgtaacctg gatagtgaaa caggggcaat ggtgcgcctg ctggaagatg 
gcgattagcc attaacgcgt aaatgattgc tataattatt tgatatttat ggtgacatat 
gagaaaggat ttcaacatcg acggaaaata tgtagtgctg tctgtaagca ctaatattca 
gtcgccagcc gacattgtca ctgtaaagct gagcgataga atgcctgata ttgactcaat 
atccggtgcg tttcctgtca aaagtatgcg tagtgctgaa catttcgcga tgaatcccac 
cgaggaagaa gcacggcgcg gttttgctaa agtgatgtct gagtttggcg aactcttggg 
taaggttgga attgtcgagg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta 
cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg 
gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac gagttcttct 
gaggggatcg gcaataaaaa gacagaataa aacgcacggg tgttgggtcg tttgttcgga 
tcgatccgtc gac 



60 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



960 
1020 
1080 
1140 



1260 
1320 
1380 
1440 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2293 
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<210> 6 
<211> 86 
<212> DNA 

<213> Artificial Sequence 



<400> 6 

cccgggatca attcaccatg ggaataactt cgtatagcat acattatacg aagttatgga 60 

tccgccgcta tcaggacata gcgttg 86 

<210> 7 
<211> 4172 
<212> DNA 

<213> Artificial Sequence 
<400> 7 

gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60 

atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120 

agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180 

ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240 

gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 3 00 

gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360 

tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420 

acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 480 

aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 540 

cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600 

gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 660 

cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720 

tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 78 0 

tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 84 0 

ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900 

tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 96 0 

gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020 

ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080 

tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140 

agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200 

aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260 

cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320 

agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380 

tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 144 0 

gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500 

gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560 

ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 162 0 

gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 16 8 0 

ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 174 0 

ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800 

acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 1860 

gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 192 0 

cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980 

gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 204 0 

gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 2100 

gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 216 0 
agctcgaaat taaccctcac taaagggaac aaaagctggg tacgaattca gatctcccgg ■ 2220 

gatcaattca ccatgggaat aacttcgtat agcatacatt atacgaagtt atggatccgg 2280 

tcgagcagtg tggttttgca agaggaagca aaaagcctct ccacccaggc ctggaatgtt 234 0 

tccacccaat gtcgagcagt gtggttttgc aagaggaagc aaaaagcctc tccacccagg 2400 

cctggaatgt ttccacccaa tgtcgagcaa accccgccca gcgtcttgtc attggcgaat 2460 

tcgaacacgc agatgcagtc ggggcggcgc ggtcccaggt ccacttcgca tattaaggtg 2520 

acgcgtgtgg cctcgaacac cgagcgaccc tgcagccaat atgggatcgg ccattgaaca 2580 

agatggattg cacgcaggtt ctccggccgc ttgggtggag aggctattcg gctatgactg 2640 

ggcacaacag acaatcggct gctctgatgc cgccgtgttc cggctgtcag cgcaggggcg 2700 

cccggttctt tttgtcaaga ccgacctgtc cggtgccctg aatgaactgc aggacgaggc 2760 

agcgcggcta tcgtggctgg ccacgacggg cgttccttgc gcagctgtgc tcgacgttgt 2820 
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cactgaagcg ggaagggact ggctgctatt gggcgaagtg ccggggcagg atctcctgtc 2880 

atctcacctt gctcctgccg agaaagtatc catcatggcc gatgcaatgc ggcggctgca 2940 

tacgcttgat ccggctacct gcccattcga ccaccaagcg aaacatcgca tcgagcgagc 3000 

acgtactcgg atggaagccg gtcttgtcga tcaggatgat ctggacgaag agcatcaggg 3060 

gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc atgcccgacg gcgaggatct 3120 

cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg gtggaaaatg gccgcttttc 3180 

tggattcatc gactgtggcc ggctgggtgt ggcggaccgc tatcaggaca tagcgttggc 3240 

tacccgtgat attgctgaag agcttggcgg cgaatgggct gaccgcttcc tcgtgcttta 3300 

cggtatcgcc gctcccgatt cgcagcgcat cgccttctat cgccttcttg acgagttctt 3360 

ctgaggggat cggcaataaa aagacagaat aaaacgcacg ggtgttgggt cgtttgttcg 3420 

gatagggatc aattcaccat gggaataact tcgtatagca tacattatac gaagttatgg 34 80 

atccactagt tctagagcgg ccgccaccgc ggtggagctc caattcgccc tatagtgagt 3540 

cgtattacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3600 

cccaactcaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 3660 

cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 3720 

gcagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 3780 

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 3840 

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 3900 

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 3960 

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ccttaatagt ggactcttgt 4020 

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4080 

tgccgatttc ggcctattgg ttaaaaaatg agctgattca acaaaaattt aacgcgaatt 4140 

ttaacaaaat attaacgctt acaatttagg tg 4172 

<210> 8 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<400> 8 

ataacttcgt atagcacaca ttatacgaag ttat 



34 



